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Abstract 

Coloring graphs is an important algorithmic problem in combinatorics with many applications in 
computer science. In this paper we study coloring tournaments. A chromatic number of a ran¬ 
dom tournament is of order ^( i 0 g( n ) )• The question arises whether the chromatic number can be 
proven to be smaller for more structured nontrivial classes of tournaments. We analyze the class of 
tournaments defined by a forbidden subtournament H. This paper gives a first quasi-polynomial 
algorithm running in time e°^ log ^ n ^ ^ that constructs colorings of H-free tournaments using only 
0(n l ~ e ^ H ^ log(n)) colors, where e(H) > 2 -250|W +1 for many forbidden tournaments H. To the 
best of our knowledge all previously known related results required at least sub-exponential time 
and relied on the regularity lemma. Since we do not use the regularity lemma, we obtain the 
first known lower bounds on e(H) that can be given by a closed-form expression. As a corollary, 
we give a constructive proof of the celebrated open Erdos-Hajnal conjecture with explicitly given 
lower bounds on the EH coefficients for all classes of prime tournaments for which the conjecture is 
known. Such a constractive proof was not known before. Thus we significantly reduce the gap be¬ 
tween best lower and upper bounds on the EH coefficients from the conjecture for all known prime 
tournaments that satisfy it. We also briefly explain how our methods may be used for coloring 
H -free tournaments under the following conditions: H is any tournament with < 5 vertices or: H 
is any but one tournament of six vertices. 
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1 Introduction 

Let | • | to denote the size of the set. Let G be a graph. We denote by V(G) the set of its vertices 
and by E(G) the set of its edges. Sometimes instead of writing |K(G)| we use the shorter notation 
|G|. We call |G| the size of G. For a subset S C V{G) we denote by G\S the subgraph of G induced 
by S. A clique in an undirected graph is a set of pairwise adjacent vertices. An independent set 
in the undirected graph is a set of pairwise nonadjacent vertices. All logarithms used in this paper 
are of base 2. 

A tournament is a directed graph such that, for every pair v and w of vertices, exactly one of the 
edges (■v , w) or (w, v ) exists. If (v, w ) is an edge in the tournament then we say that v is adjacent to 
w and w is adjacent from v. The indegree of a vertex v of a tournament T is the number of vertices 
w G V(T) such that (w,v) G E(T). Similarly, the outdegree of a vertex v of a tournament T is 
the number of vertices w G V(T) such that (v,w) G E(T). A directed cycle is a set of vertices 
{uo,..., Vk-i} for some k > 3 such that (uj, U( i+1 ) mod fc) is a directed edge for i = 0— 1. A 
tournament is transitive if it contains no directed cycle. For the set of vertices V = {v\,V 2 , ..., Vk} 
we say that an ordering (iq, V 2 , ■■■, Vk) is transitive if v\ is adjacent to all other vertices of V, V 2 
is adjacent to all other vertices of V but v±, etc. A subset S C V(T) is transitive if it induces a 
transitive tournament. For a tournament H we say that a tournament T is H- free if T does not 
contain H as an induced subtournament. 

A proper coloring of a tournament T is an assignment of colors to its vertices such that there 
is no directed monochromatic cycle. Equivalently, we can consider a proper coloring of a hyper¬ 
graph, where the set of vertices is V (T) and the set of hyperedges consists of all triples of vertices 
inducing directed triangles in T. When properly coloring a hypergraph, we do not want to create 
monochromatic hyperdges. The latter is equivalent to our previous definition of coloring since one 
can easily note that if a monochromatic directed cycle exists in T then a monochromatic directed 
triangle exists as well. The chromatic number of a tournament T is the minimal number of colors 
needed to properly color T. Note that under every proper coloring of the vertices of T each color 
class induces a transitive subtournament. Let Tf be a random n- vertex tournament, where inde¬ 
pendently for every pair of vertices {u, v}. we have: (u, v ) G E(T) with probability It is not hard 
to prove that with probability tending to 1 as n —> oo the largest transitive subtournaments of Tf 
are of logarithmic size. Thus, according to the remark above, the chromatic number of a random 
tournament is )• 

A celebrated unresolved conjecture of Erdos and Hajnal states that: 

Conjecture 1.1 For every tournament H there exists e(H) > 0 such that every n-vertex H-free 
tournament contains a transitive subtournament of size at least n e ( H ' ) . 

In fact the conjecture was first proposed in the undirected setting by Erdos and Hajnal but was 
proven to be equivalent to the directed setting above by Alon, Pach and Solymosi in 2001 (see: 
HD- The undirected version of the conjecture (see: ®) states that: 

Conjecture 1.2 For every undirected graph H there exists e{H) > 0 such that every n-vertex 
graph G that does not contain H as an induced subgraph contains a clique or an independent set of 
size at least n e ^. 

If for a given tournament H there exists e{H) > 0 then we say that H satisfies the Erdos-Flajnal 
conjecture with e{H) or simply: H satisfies the Erdos-Flajnal conjecture. The coefficient e(H ) in 
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the statement is called the EH coefficient. From now on instead of saying: ’’the conjecture of 
Erdos and Hajnal” we will simply say: ’’the conjecture”. From the context it will be always clear 
whether we have in mind a directed or an undirected version. 

A subset of vertices S C V(H) of a tournament H is called homogeneous if for every v £ V ( H)\S 
the following holds: either V w& s{ui,v) £ E(H) or V we s(v,w) £ E(H). A homogeneous set S is 
called nontrivial if |<Sj > 1 and S 7 ^ V(H). A tournament is called prime if it does not have 
nontrivial homogeneous sets. 

We call by C 5 a unique tournament on five vertices, where each vertex has indegree and outde- 
gree two. 

2 Main results and related work 

Now we summarize our main results. We present a family of tournaments called constellations and 
show a quasi-polynomial algorithm running in time e^ log ^ n ^ that constructs a proper coloring of 
an H- free tournament T, where H is a constellation, with 0(n 1-e ^) log(n)) colors, where e(H) = 
2 _ 2 5 °I w I + 1 _ yyg p rove that: 

2.1 If H is a constellation then every n-vertex H-free tournament may be properly colored with 
n i-e(H) log(n) colors, where e(H ) = , ) 50 *a +1 and h = \H\. Furthermore, every H-free tournament 

T contains a transitive subtournament of order at least \T\ e ( H h 

Constellations play important role in the conjecture since all known prime tournaments with 
more than six vertices satisfying the conjecture are constellations. Furthermore all tournaments for 
which the conjecture has been proven so far can be obtained from an infinite family of constellations 
and three other tournaments after applying the so-called substitution procedure introduced in [I]. 
Prime tournaments are important since if the conjecture is true for prime tournaments then it is 
true in general. All tournament apart from a tournament C 5 and some two six-vertex tournaments, 
considered in papers such as: m, are special examples of tournaments that can be obtained 

from constellations using the substitution procedure. We will briefly explain how, as a byproduct 
of our techniques, one can color H-free tournaments with 0 (n 1-e ) colors for every tournament 
H on at most 5 vertices and all but one tournament H on 6 vertices. Our techniques give a 
constructive proof of the Erdos-Hajnal conjecture for all the constellations. Besides, after combining 
our methods with the substitution procedure, we obtain explicit lower bounds on EH coefficients 
for all known tournaments satisfying the conjecture. The family of constellations was introduced 
in [?]. In the same paper the conjecture was proven for them. However no algorithm to construct 
the 0(n 1_e )-coloring efficiently was given. Furthermore, even though that paper showed that an 
optimal coloring uses (^(n 1 ^ 6 ) colors, the constant e > 0 was extremely small since the proof heavily 
relied on the regularity lemma. In this paper we show that the regularity lemma is not needed at 
all and thus we obtain much better bounds. Our main contribution is an algorithmic proof that 
does not use the regularity lemma and a general method that can be used to show first explicit 
lower bounds on EH coefficients for all known tournaments satisfying the conjecture. This leads to 
better understanding of the asymptotics of EH coefficients. All previously known positive results 
regarding the conjecture for prime tournaments needed the regularity lemma. That implied big 
gaps between best known upper bounds on e(H) of the order ^ and best known lower bounds 
that were inversely proportional to the Szemeredi tower function. In this paper we significantly 
reduce this gap. In their original paper Erdos and Hajnal asked how the EH coefficient depends on 
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graph parameters such as its size. Our paper provides a step towards an answer for prime graphs. 
Previous results concerning upper bounds for EH coefficients of prime graphs were given (see: [3], 
0 ) but these lacked explicit lower bounds. This paper attempts to fill the gap. 

Our techniques may be easily adapted to other problems regarding coloring classes of tourna¬ 
ments defined by forbidden subtournaments. 

Let us end this section by briefly summarizing recent progress regarding the directed version 
of the conjecture. [6j described all tournaments satisfying the conjecture in the strongest, linear 
sense. Similarly, all tournaments satisfying the conjecture in an almost linear sense (so-called 
pseudocelebrities ) were described by [7]. However, both results are for tournaments that are not 
prime. In [5] and [4] several results regarding the conjecture for prime tournaments were proven. 
All the previous positive results were of purely theoretical flavor and did not easily translate into 
algorithmic results. 

The paper is organized as follows: 

• In Section 3 we formally define the family of constellations and introduce other important 
definitions. 

• In Section 4 we give an algorithm to color H -free tournaments for all constellations H. 

• In Section 5 we prove correctness and analyze the running time of the presented algorithm 
and therefore prove Theorem 12.11 

• In Section 6 we discuss some further applications of the introduced techniques. 


3 Constellations 

In this section we define the family of constellations and introduce other important definitions that 
will be used in further analysis. 

Fix some ordering of vertices of a tournament H. An edge {v, w ) under this ordering is called a 
backward edge if w precedes v in this ordering. Let T be a tournament with vertex set V ( T ) and fix 
some ordering of its vertices. The graph of backward edges under this ordering, denoted by B{T, 6 ), 
has vertex set V(T), and {vi,Vj} £ E(B(T,6)) if and only if ( Vi,Vj) or ( Vj,V {) is a backward edge 
of T under the ordering 9. For an integer t, we call the graph K\j a star. Let S be a star with 
vertex set {c,..., It}, where c is adjacent to vertices l\,... ,l t . We call c the center of the star, 
and l\,... ,lt the leaves of the star. Note that in the case t = 1 we may choose arbitrarily any one 
of the two vertices to be the center of the star, and the other vertex is then considered to be the 
leaf. Let 9 = (v\,V 2 , ■■■,v n ) be an ordering of the vertex set V(T) of a n-vertex tournament T. For 
a subset S C V(T) we say that Vi £ S is a left point of S under 9 if i = rnin{j : Vj £ S'}. We say 
that Vi £ S is a right point of S under 9 if i = max{j : Vj £ S}. If from the context it is clear 
which ordering is taken we simply say: left point of S or right point of S. For an ordering 9 and two 
vertices v; t , v 3 with i ^ j we say that Vi is before Vj if i < j and after Vj otherwise. We say that a 
vertex Vj is between two vertices Vi,Vk under an ordering 9 = (iq,..., v n ) if i < j < k or k < j < i. 

A right star in B(T, 9) is an induced subgraph with vertex set {vi 0 , ... ,Vi t }, such that 
B(T,9) IK,..., Vi t } is a star with center Vi t , and it > io, - ■ ■ ,it- 1 - In this case we also say that 
{vi 0 , ... ,Vi t } is a right star in T. A left star in B(T,9) is an induced subgraph with vertex set 
{vi 0 , ..., v ; H }, such that B(T, 9)\{vi 0 , ..., v H } is a star with center Vi 0 , and io < i\,... ,i t . In this 
case we also say that {vi 0 , ..., Vi t } is a left star in T. From now on whenever we will refer to the 
star in B(T, 9) we will mean a left star or a right star. 
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Let H be a tournament and assume there is an ordering 9 of its vertices such that every 
connected component of B(H,9 ) is either a star or a singleton under this ordering. We call this 

ordering a star ordering. The interstellar graph of H under a star ordering 6 is an undirected graph, 

whose vertices are the sets of leaves of the stars of H under 6 and any two given vertices L\ and 
L -2 are adjacent iff: 

• the left point of L\ precedes the right point of L 2 in 6 and 

• the left point of L 2 precedes the right point of L\ in 6. 




Fig.l Constellation with backward edges drawn under its constellation ordering and consisting of 
three stars: £ 1 , £ 2 , £3 (above). The interstellar graph where nodes correspond to the stars and 
two edges are adjacent if and only if the intervals defined by the sets of leaves under given 

constellation ordering intersect (below). 

For each connected component C of the interstellar graph of H denote by Z(C) the union of 
subsets of V(H) corresponding to its vertices (this is the union of some subsets of V(H) of the 
vertices of H). Next let us define C(Z(C)) as follows. We say that a vertex v G C(Z(C)) if v G Z(C) 
or v is between some two vertices of Z(C) under the ordering 9. Let C\,...,Ck be the connected 
components of the interstellar graph. Note that for any given 1 < i < j < k either every vertex 
of C(Z(Ci )) is before every vertex of C(Z(Cj)), or every vertex of C(Z(Cj)) is before every vertex 
of C(Z(Ci)). Thus there is a natural ordering of the sets C(Z(Ci)) for i = 1,2,..., k induced by 
the ordering of the vertices. Denote the ordered sequence of the sets C(Z(Ci)) for i = 1, 2,..., k as 
(Wi,...,, Wfc), where a set W) is before a set Wj for 1 < i < j < k. Denote Wo = Wfc +1 = 0- For 
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i = 1,2, ...,k + 1 denote by IZi the set of the vertices of H that are after all the vertices of W*_i 
and before all the vertices of W,; under the ordering 9. Note that if IZi is nonempty then all its 
elements are centers of the stars of H. Denote the set of nonempty sets IZi as {.Adi, •••, A 4 r } for 
some r > 0. Note that {Wi,Wfc, ..., M. r } is a partition of the vertices of H. Denote this 
partition by Pg(H). We are ready to define constellations. 

A tournament T is a constellation if there exists a star ordering 9 of its vertices such that if a 
center of a star belongs to some set P € Pg(H ) then no leaf of this star belongs to P. 

We call such an ordering a constellation ordering of T. Let be the non-singleton 

components of B(T, 9). We say that Siare the stars of T under 9. If V ( T ) = (J* =1 V (S i), 
we say that T is a regular constellation. 

Even though the definition of the family of constellations that we have just presented seems to 
be complicated, it is in fact easy to construct examples of constellations of an arbitrary size. That 
is because our definition uses the notion of the constellation ordering and this term has natural 
and straightforward pictorial interpretation. 



Fig.2 Constellation consisting of two stars - one left and one right. For the clarity of the picture 

only backward edges were drawn. 

A galaxy ordering of the vertices of the tournament is the constellation ordering under which 
no center of the star appears between leaves of another star. Notice that this is not necessarily the 
case for the constellation ordering. A center of the star can be between leaves (call this set C) of 
another star, but if it happens then all its leaves have to be in a different connected component 
of the interstellar graph that the one that corresponds to C. A galaxy is a tournament that has a 
galaxy ordering of vertices. Galaxies is a subfamily of constellations. The conjecture was proven 
for them in [5]. That was the first result where the conjecture was proven for an infinite family of 
prime tournaments. Our constructive proof gives much better lower bounds on EH coefficients for 
galaxies than those from j5j. 



Fig.3 Galaxy consisting of one left and two right stars. All edges that are not drawn are forward. 
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3.1 Some useful definitions 


This section provides several definitions used in the paper. 

Take a tournament T. Let X,Y C V(T) be disjoint, where |AT|, |T| > 0. Denote by ex,Y the 
number of directed edges (x,y), where x € X and y € Y. The directed density from X to Y is 
defined as d{X,Y) = p^. 

We say that a tournament T is e-transitive if it contains a transitive subtournament of order at 
least \T\ € . 

For the transitive pairwise disjoint subsets Ti,T 2 , T\ C V(T) we say that a sequence 

(Ti, T 2 , ..., Tfc) is a (c, A, e)-t-sequence of length k (where t stands for: transitive ) if the following 
holds: 


• d(Ti,Tj) > 1 — A for 1 < i < j < k, and 

• | Ti\ > c\T\ e for i = 1, 2, k. 

For the transitive pairwise disjoint subsets 7j, T 2 ,C V(T) we say that a sequence 
(Tj, T 2 , Tj,.) is a smooth (c, A, e)-t-sequence of length k if the following holds: 

• d({u}, Tj) > 1 — A for 1 < i < j < k and v € Tj, 

• d(T t , {u}) >1 — Aforl<i<j<fc and v £ Tj, and 

• |Tj| > c|T| e for i = 1, 2,..., k. 

Note that every smooth (c, A, e)-t-sequence is a (c, A, e)-t-sequence. 

For the pairwise disjoint subsets C V(T) we say that a sequence (^i, A 2 , ■■■, A},) is a 

(c, X)-l-sequence of length k (where l stands for: linear) if the following holds: 

• d(Ai, Aj ) > 1 — A for 1 < i < j < k, and 

• \Ai\ > c\T\ for i = 1,2,..., k. 

For the pairwise disjoint subsets A\,...,A^ C V(T) we say that a sequence (A\, A 2 ,A^) is a 
smooth (c, A )-l-sequence of length k if the following holds: 

• d({v}, Aj) > 1 — A for 1 < i < j < k and v € Ai, 

• d(Aj, {u}) > 1 — A for 1 < i < j < k and v € Aj, and 

• |-Ai| > c|T| for i = 1, 2,..., k. 

Note that every smooth (c, A)-/-sequence is a (c, A)4-sequence. 

For the pairwise disjoint subsets Ai, ..., Af- + i, Ti,..., Tk C V(T), where Ti,...,Tfc are transitive, 
we say that a sequence (A\, T\, ..., A^, T^, A^ + i) is a (c, A, e)-m-sequence of length 2k + 1 (where m 
stands for: mixed) if the following holds: 

• d(Ti,Tj) > 1 — A for 1 < i < j < k, 

• d(Ai , Aj) > 1 — A for 1 < i < j < k, 

• d(Ai,Tj) > 1 — A for 1 < i < j < k, 

• d(Ti,Aj) > 1 — A for 1 < i < j < k, 
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• |-Ai| > c|T| for i = 1, 2, k,k + 1, and 

• |T| > c\T\ e for i = 1, 2, k. 

We refer to the sets T\. from the m-sequence as transitive sets of the m-sequence. We say 
that a m-sequence of length 2k + 1 is M-hig if Tj > M for i = 1, 2, k. 



Fig.4 Schematical representation of the (c, A, e)-m-sequence. This sequence consists of three linear 
sets: A\, A 2 , A 3 and two transitive sets: T\ and T 2 . The arrows indicate the orientation of most of 

the edges going between different elements of the (c, A, e)-m-sequence. Each Tj satisfies: |Tj| > cn e 

and each A t satisfies: |Aj| > c ■ n, where n = |T|. 

For the pairwise disjoint subsets A\, ..., A*. +1 , Ti,Tj. C V(T), where Ti,..., T^ are transitive, 
we say that a sequence (Ai, Ti,..., A&,T*,, Afc+i) is a smooth (c, A, e)-m-sequence of length 2k + 1 if 
the following holds: 

• sequence (Ti, Tyf) is a smooth (c, A, e)-t-sequence 

• sequence (A 1 ; Afc +1 ) is a smooth (c, A)-/-sequence 

• d({v}, Tj) > 1 — A for 1 < i < j < k and v G Ai, 

• d(Ai , {u}) > 1 — A for 1 < i < j < k and v G Tj, 

• d({v}, Aj) > 1 — A for 1 < i < j < k and v G T, 

• d(Ti, {u}) >1 — Aforl<i<j</c and v G Aj, 

• |Aj| > c|T| for i = 1, 2, k, k + 1, and 

• |Tj| > c|T| e for i = 1,2, k. 

Note that every smooth (c, A, e)-m-sequence is a (c, A, e)-m-sequence. 

If a smooth (c, A, e)-m-sequence (Ai, Ti, A^, T^, A^ +1 ) satisfies: 

• d(T{,Tj) = 1 for i,j G T, where i < j and Z C {1,2, 

then we say that (Ai, Ti,..., A*,, Tfc, A^+i) is an 1-strong (c, A, e)-m-sequence of length 2k + 1. 

Whenever we do not care about parameters of the (c, A, e)-f-sequences, (c, A)4-sequences or 
(c, A, e)-m-sequences under consideration, we refer to them simply as: f-sequences, /-sequences and 
m-sequences respectively. 

For two disjoint subsets Ai,Ti C V(T) such that T\ is transitive we say that a pair (Ai,Ti) is 
(c, e) -saturated if the following holds: 

• |Ai| > c|T|, 

• |Ti | > c|T| e , and 

• d(Ai,Ti) = 1 or d(T\,Ai) = 1. 
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4 Quasi-polynomial algorithm for coloring H- free tournaments for 
a constellation H 

4.1 Introduction 

Whenever we consider algorithms involving tournaments, we assume that the input is a tourna¬ 
ment description given by adjacency lists. This section presents two main algorithms (and several 
subroutines used by them): one that finds a polynomial-size transitive subtournament in the strong 
m-sequence of an H-free tournament (algorithm PolyTrans ) and one that colors an iL-free tourna¬ 
ment (algorithm Color-H-free), where H is a constellation. The former is in fact used to construct 
the coloring produced by the latter one. The latter one takes as an input only forbidden constel¬ 
lation H and H- free tournament T. The core part of the algorithm PolyTrans is the recursive 
algorithm PolyTransCore that gets as an input colored m-sequence (the coloring is done in the 
initial phase of the algorithm PolyTrans ) and outputs a polynomial-size transitive subset. 

We would like to give a general idea of our algorithmic approach first since the algorithm is 
complicated. Assuming that T is an H- free tournament, we find in T a long enough sequence of 
big linear and transitive sets such that most of the edges between those sets go from these sets that 
are earlier in the sequence to those that are placed later. At that point no assumption about the 
structure of H is required. We handle that part of the algorithm without the use of the regularity 
lemma and that enables us to significantly improve best known lower bounds on e(H). In the 
second stage we heavily use the fact that H, as a constellation, has a specific ordering of vertices. 
The algorithm finding polynomial-size transitive subtournament uses the following technical proce¬ 
dures: Find-L-Sequence, Find-Clique, Find-M-Sequence, MakeSmooth, FindStrong-M-Sequence. Al¬ 
gorithm Find-L-Sequence finds a (c, A)-Lsequence in an H- free tournament. Algorithm Find-Clique 
finds a clique in a dense fc-partite undirected graph. Algorithm Find-M-Sequence is responsible for 
finding m-sequences in iL-free tournaments. Algorithm MakeSmooth makes them smooth. Finally, 
algorithm FindStrong-M-Sequence constructs a strong m-sequence in an iL-free tournament. That 
sequence is the input for the PolyTrans algorithm. This short summary will be clearer later when 
we describe all the algorithms in detail. 

Let H be a constellation with h = \H\. Let 8 be a constellation ordering of H. We define a 
function ( : V(ff) -> N as follows: 

• if v £ V(H) is a leaf of a star let £(v) = i(2h + 1) + 1, where v is the ith leaf under ordering 

e, 

• otherwise let C(v) = j(2h + 1) + 2r, where v is the ?’th center after the jth leaf and before the 
(j + l)th leaf under the ordering 8. 

First we show algorithms: PolyTrans and Color-H-free. Then we will describe all supportive 
procedures mentioned above. 

4.2 An overview of the method 

The constructive proof we are about to present that all the constellations satisfy the conjecture is 
very technical. Therefore before going into details we would like to explain main steps of the proof. 
In this subsection we give reader the intuition how the proof works, what are the most important 
parts of the proof, finally - why the ideas used to prove the conjecture for galaxies are not sufficient 
to succeed with constellations and what are the new techniques that need to be used in this setting. 

Notice first that we can always assume that we have a sequence of linear sets and big transitive 
subtournaments such that for any two of them the one that appears first in the sequence is almost 



complete to the one that appears later. This sequence is what we call a (c, A, e)-m-sequence. Pa¬ 
rameter c encodes lower bounds on the sizes of linear and transitive elements. Parameter e specifies 
the value of the exponent in the lower bound on the size of the transitive element. Finally, param¬ 
eter A specifies lower bound on the directed density between different elements of the sequence. By 
’’almost adjacent” we mean that the directed density is very close to 1. Existence of such a sequence 
is an immediate consequence of the regularity lemma, but since we do not want to use that tool, 
we prove that fact in a very different way. This enables us to get a bound on the EH coefficient 
that can be expressed in the compact way. What is important here is that at this point we did not 
need to assume any specific structure of the tournament H that is being excluded. If we need to 
summarize the proof of the conjecture for galaxies from [5] in one sentence, we should say: ’’linear 
sets for centers of stars, transitive subtournaments for leaves”. We give a proof by contradiction, 
assume that an H -free tournament does not have polynomial-size transitive subtournaments and 
construct a copy of H in it. The copy will be constructed star by star. We proceed by creating first 
an appropriate (c, A, e)-?n-sequence mentioned above. Then we are looking for stars with centers 
in linear sets and leaves in transitive sets. When the star is found it is removed and the entire 
sequence is updated. The update is done in such a wat that we can simply merge a star we have 
just found with the remaining part of the tournament H (that will be found in the new sequence) 
to get a copy of H. 



Fig.5 The method to construct polynomial-size transitive subtournament. A circle represents a 
linear set and {uq, ...,ws} is a big transitive subset that (by induction) can be extracted from it. 
A set {ui, ...,tt 3 } represents another transitive set of substantial size. Set {uq, ..., 103 } is complete 
to {tti, ..., 113 }. Thus {uq, ...,W 3 ,ui, ..., 143 } is also a transitive set. For e > 0 small enough this set 
is of size at least n e , where n is the size of the H -free tournament T. 

If the star cannot be found, then a simple argument using Pigeongole principle shows that we 
must have a substantial transitive subset complete to/from a linear set (see: Fig. 5). The key 
observation here is that a transitive set of the substantial size complete to or from a linear set 
gives us a polynomial-size transitive subtournament. The crucial element that makes this method 
work is that whenever we are looking for a star there is no need to take care of the right type 
of adjacency between leaves if all candidates for them were chosen from the same transitive set. 
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The right type of adjacency is given for granted. The price that is paid is the fact that we cannot 
allow centers of stars to be between leaves of another star since that would require looking for 
centers also in transitive chunks. That in turn will not enable us to get a polynomial-size transitive 
subtournament. This was also the main reason why the authors started to work on more general 
techniques that would handle cases where centers of stars are between leaves of another stars, i.e. 
more general configurations of stars. The conceptual idea of this more general technique is to 
change the paradigm: ” centers in linear sets, leaves in transitive subtournaments” by starting with 
the (c, A, u>)-t-sequence that consists only of big transitive chunks (by big we mean of size at least 
c • \T\ e for some constant c, e > 0). 

We start proceeding as in the galaxy-proof, but try to use f-sequences first (in particular we 
look for centers of stars in transitive chunks, see Fig. 6). 




Fig. 6 Constellation H with its constellation ordering of vertices (above) and the associated 
f-sequence y. Sequence x consists of three transitive sets, first two are partitioned into three 
equal-length subsets. There is a 1-1 mapping between vertices of H and subchunks TVj. The goal 

is to look for a node Vj in TV*. 

The problem we face (that was mentioned by us before) is that now we do not necessarily get 
as an outcome a big linear set complete to or from a big transitive chunk. Instead, we obtain two 
transitive chunks such that one of them is complete to the other one. However those chunks, even 
after merging, may not give big enough transitive subtournament. This is the place where we need 
to use strong (c, A, e)-m-sequences. 

We repeat our previous procedure several times in many different regions of the long enough 
t-sequence. By doing it, using Ramsey argument, we can conclude that we get arbitrarily large 
set of transitive subchunks of the elements of the sequence with the additional property that the 
subchunks appearing earlier in the sequence are complete to those appearing later (altogether they 
may still not give a big enough transitive subtournament). We may also assume without loss of 
generality that between those subchunks we have big linear sets such that each linear set is almost 
complete from all subchunks preceeding it and almost complete to all subchunks that it preceeds. 
This can be easily done if we slightly enrich the t-sequence we started with by introducing linear 
sets between transitive chunks. This can be always achieved since we have already observed that 
we can start with the arbitrarily long (c, A, e)-m-sequence. But now we can again try to build a 
constellation star by star. Notice however that in contrast to the previous scenario, we can look for 
different leaves of the same star in different transitive chunks (see Fig. 7). This is possible because 
for any two transitive chunks of the strong sequence the one appearing earlier in the sequence is 
complete to the one appearing later. Therefore while constructing a star in such a way that leaves 
are being found in transitive chunks we have the right type of adjacency between those candidates 
for leaves for granted. This is no longer true if the structure we are given is no strong. 
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Fig. 7 Constellation H with its constellation ordering of vertices (above) and the associated 
section of the (c, A, e)-m-sequence x corresponding to H. This section of the sequence x consists 
of three transitive sets and three linear sets. The second transitive set is partitioned into two 
equal-length subsets. Centers of stars as well as singletons (u 2 , U 5 , U 7 ) are being looked for in the 
linear sets (A\, A 2 , A 3 ). Leaves of stars are being looked for in the transitive sets TV*. 

We cannot get a strong (c, A, ^-m-sequence immediately. It can be constructed if the first 
approach that we used to find a copy of H or a polynomial-size transitive subtournament fails. If 
at any stage of our analysis in the second part of the algorithm (when we operate on the strong 
m-sequence) some star cannot be constructed then we get a linear set complete to/from a big 
transitive chunk. But now that gives us polynomial-size transitive subtournament and the proof is 
completed. Note that even though we significantly relax the condition we put on the configurations 
of stars by introducing the family of constellations, we still cannot look for a center of the star 
and some of its leaves in the same transitive chunk. This is the case since obviously no backward 
edges can be found in a transitive chunk. Therefore there are still some limitations put on the 
configuration of stars. However, as mentioned before, there are much weaker than previously. 

We are ready now to go into technical details of our algorithm. 

4.3 Algorithm PolyTrans 

We start with the PolyTrans algorithm that takes as an input an Z-strong IV-big (c, A, e)-m-sequence 
of the iL-free tournament T and outputs a transitive subtournament of T of size at least \T\ e , where 

e = for d = i™ 2 "’ as long as ^ - gasfe- h ' n = ~ and N ~ 221k2 ■ Wliat is also 

given as a part of the input for PolyTrans is the procedure V which, for every subtournament Tg 

of T other than T, computes a transitive subtournament of Tg of order at least |Ts| e . Assume that 
V(H) = {1,2, ...,h} and that (1,2, is a constellation ordering of H. 

Algorithm 4.1 (Algorithm PolyTrans returning a transitive subtournament of a polynomial-size) 

• Input: An T-strong (c, A, e) -m-sequence of length k = 2 h 2 + Ah + 1 for T = {h + 1,2 h + 

2^2 

1 , ...,/r 2 + 1} in an H-free tournament T of order n > — c —. The sequence is N-big for 
N > 2 21h ' 2 . Tournament H is a constellation. It is assumed that: A < >25 {^- and e = 
for c = ■ Procedure V is given that for every subtournament Tg of T different than T 

computes a transitive subtournament of Tg of order at least |Ts| e . It is assumed that this 
procedure runs in time /r(|Ts|) for some given function h. 
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• Output: A transitive subtournament of T of size at least \T\ e . 

• Description: Initialize a to be the set of all the stars of H . We first run algorithm MakeSmooth 
(with f = 0 ) on the m.-sequence from the input to make the sequence smooth. 

Algorithm MakeSmooth outputs a smooth (§,4A, e)-m-sequence which we denote as: [C i, Ck) 
(see: description of MakeSmooth). We give colors to the vertices of (C\, ...,Ck) as follows: 

— if Ci = C av) for some v £ V(H) and i is even, i.e. Ci is transitive, then color the first 
\C -1 

L 3 ii vertices of Ci in the transitive ordering by v and the rest by h + 1 , 

— if Ci = C^r v \ for some v £ V(H ) and i is odd then color an arbitrary subset of 
vertices of Ci by v and the rest by h + 1 , 

— otherwise color all vertices of Ci by h + 1 . 

Then run algorithm PolyTransCore (see: below) on the colored smooth (|,4A, e)-m.-sequence 
described above for parameter £ = procedure V and a set of stars a. 

Algorithm 4.2 (Algorithm PolyTransCore) 


• Input: A (c, A, e)-m-sequence (C\,...,Ck) of the H-free tournament T, where vertices of CiS 
are colored by the set {1,2, ...,h + 1}, a set of stars a and parameter £ > 0. It is assumed 
that if there exists a vertex of Ci colored by some color j then at least £,\Ci\ vertices of Ci are 
colored by that color. Procedure V is given that for every subtournament Ts of T different 
than T computes a transitive subtournament of Ts of order at least |Ts| e . 

• Output: A transitive subtournament of T of size at least \T\ e . 

• Description: Take a star X* £ a. Without loss of generality assume that it is a left star. 
Denote by c its center and by y i, ...,y q its leaves. Let X be the set of vertices of C^( c ) that are 
colored by c and let T l be the set of vertices of C^ y .) that are colored by yi for i = 1,2, q. 
Assume first that there does not exist a vertex r £ X and vertices r i, ...,r q such that ri £ T l 
and t is adjacent from all r\,...,r q . Then, by the Pigeonhole Principle, at least I\X\ vertices 
of X, call this set C, are adjacent to all vertices of for some j* £ {1,2,..., q}. Then 
run procedure V on T\C and merge the transitive subtournament output by the procedure with 
T J * to get a bigger transitive subtournament T b . Output T b . Assume now that there exists a 
vertex r £ X and vertices ri, ...,r q such that r j £ T l for i = 1,2,..., q and r is adjacent from 
all r i,..., r q . Let D^ j be a subset of vertices from Ci, other than X, T 1 ,..., T q , colored by some 
fixed color j. Denote by N 0 1,3 C D % ^ the subset of Dij consisting of vertices of D^j adjacent 
from r if r is before D h] in the m-sequence and adjacent to r otherwise (the ordering in the 
m-sequence is induced by an ordering of the sets Ci and a transitive ordering within transitive 
parts of the m-sequence). Similarly, denote by N k 1,3 C D tJ for k = 1,2 ,...,q the subset of 
D t j consisting of vertices of D^j adjacent from r* if r* is before D^j in the m.-sequence and 
adjacent to r * otherwise. Denote N Di ^ = N Dt -i H^o where N D *o = f\-=i 2 q ^k 

run algorithm MakeSmooth with parameter f satisfying: (1 — /) = £(1 — ^r) on the given 

m-sequence {C\, ...,Ck), where we take: S( = N Di ’T We obtain new smooth m.-sequence that 
we denote as x- In this new m.-sequence the coloring is inherited from the old one. We delete 
X* from g. Now we rerun algorithm PolyTransCore on x with the updated set of stars and 
updated parameters c, A,£: c— > ^(1 — A —> y 2 ; £ —>• |(1 — ^r). 
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The naive implementation of PolyTrans algorithm runs in time h(( 1 — c)n) + poly(n), where 
poly(n ) is an expression polynomial in n. This is true since whenever procedure V is called it is 
run on a tournament of at most (1 — c)n vertices. The correctness of the algorithm PolyTrans is 
proven later. 


4.4 Algorithm Color-H-free 

We are ready to give our main algorithm that for a given constellation H with \H\ = h, properly 

^_ 1 

colors every H- free tournament T using at most |T| 2 250h ' 2+1 log(|T|) colors and runs in time 

e O(iog(n) ) .As a byproduct, the algorithm finds in T a transitive subtournament of order at least 

i 

at least |T| 2 250h2+1 . This tournament is the first color class constructed by the algorithm on the 
way to produce the entire coloring. 


Algorithm 4.3 (Algorithm Color-H-free coloring H-free tournaments, where H is a constellation) 


• Input: Constellation H and H-free tournament T. 

1 —— hz — 

• Output: Proper coloring of T that uses at most |T| 2250 +1 log(|T|) colors. 

• Description: Run algorithm Find-L-Sequence with A = and k = 2h + 3 to get a l- 
sequence C. Truncate the constructed l-sequence C by deleting its last element. Note that 
what we get is another l-sequence Cf. Give it as an input to algorithm Find-M-Sequence. 
As a procedure Sub needed by Find-M-Sequence (see the description of the algorithm in the 
next subsection) use algorithm Color-H-free itself (thus we use a recursive call of the algo¬ 
rithm Color-H-free on the smaller graph). Denote the output m-sequence by M. Truncate 
it (by deleting its last elements) to reduce its length to exactly 2 h+2 (h + 1) + 2h + 1. De¬ 
note the truncated m--sequence as Ai f . Now run on At 4 algorithm FindStrong-M-Sequence 
to get an T-strong m.-sequence M s . Give M s as an input to algorithm PolyTrans. Again, 
as a procedure V needed by FindStrong-M-Sequence use algorithm Color-H-free itself. As an 
output we obtain a transitive subtournament T\. This transitive tournament becomes a first 
element of the output of the algorithm Color-H-free (remember that Color-H-free outputs both: 
polynomial-size transitive subset and the coloring). The remaining element of the output is 
the coloring and we are about to give it. Color all vertices ofT\ by 1. Remove T\ from T and 
repeat the entire procedure to obtain a transitive subtournament T). Color all its vertices by 
2. Remove T -2 from T and keep repeating the procedure. Continue until the tournament you 
are left with is empty. By that time all the vertices of T were colored and one can easily note 
that this is a proper coloring of T. The color classes are the sets: V(T\), V(T 2 ),... Output 
that coloring. 

4.5 Technical algorithms 
4.5.1 Introduction 

In this section we present algorithms: Find-L-Sequence, Find-Clique, Find-M-Sequence, MakeSmooth, 
FindStrong-M-Sequence that serve as technical subroutines for algorithms introduced earlier. 
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4.5.2 Algorithm Find-L-Sequence 


For a given 0 < A < 1 and k > 0 let A* = ( 4 ^ 4k ) 2lh ~ l for i = 0,1, k. 

We start by describing Find-L-Sequence algorithm that calculates a (c, A)-/-sequence of length 2 k 
for some given k > 0 in the 14-free tournament T with: \T\ > — —where H is an arbitrary 

k x \n\k 

tournament with \H\ >2,V ( H ) = {iq, Vh}, and c = c(H , k, A) = 2 ^\H\ ik • The algorithm Find-L- 
Sequence uses subroutine Find-L-Sequence- Core which recursively runs algorithm Find-L-Sequence 
on the smaller input. 

Algorithm 4.4 (Algorithm Find-L-Sequence constructing a (c, A) -l-sequence in an H-free tourna¬ 
ment) 


• Input: (X,k, H,T), where T is H-free, \H\ > 2. 

, ^hk 

• Output: A (c, A )-l-sequence of length 2 k in H, where c = c(H , k, A) = 2fc ^ 2fc . 

• Description: If k = 0 then output V(T). Assume now that k > 0. Then choose in T 
arbitrarily h pairwise disjoint sets: Si,...,Sh, each of size LfpiJ? where n = |T|. Run subrou¬ 
tine Find-L-Sequence-Core with parameters: ( H , A, A^, k, H , T\S\ U ... U S^, S \,..., S^), where 

h = \H\. 

Subroutine 4.1 (Subroutine Find-L-Sequence-Core) 

• Input: (H, X,X k ,k,H r ,T r ,S tl ,...,St lHri ), where k > 0, \H\ > 1 , V(H r ) = {v tl , ...,v t]Hrl } for 
some vertices: v^, ...,v^ Hr ^ and T r is H-free. It is assumed that: S^,S^ H ^ are nonempty. 

• Output: Sequence of pairwise disjoint subsets ofV(T ) of length 2 k (see: Description below). 

• Description: If\H r \ = 1 then raise an exception (in the analysis of the algorithm we will 
show that the exception in fact will never be raised). Now assume that \H r \ > 1. Denote 
h r = | H r |. For a vertex v € Sq and an index j = 2,3,...,h r denote by Nj(v) the set of 
vertices of Stj adjacent from v if vt :j is adjacent from or adjacent to v if vt j is adjacent to 
vt x in H r . Calculate values |iVj(u)| for every v € Sq and j = 2, 3,..., h. 

If there exists a vertex v* G S^ such that \Nj(v*)\ > Afc|5 4j . | for every j G {2, 3,..., h r } then 
run recursively subroutine Find-L-Sequence-Core with parameters: 

(H, A, A fc) k, H r \{v tl }, T r | N 2 (v*) U ... U N h r(v*),N 2 {v *),..., N h r{v*)). 

If this is not the case then (by Pigeonhole Principle) at least hr t f 1 vertices v of S^ satisfy 
the following: there exists j* G {2,3,..., h r } such that |iVj*(u)| < Afc|iS't *|. Denote this set 
of vertices by W. Run algorithm Find-L-Sequence with parameters: (A, k — 1, H,T r \W) to 
get a l-sequence: (Ai,..., A 2 k-i). Run algorithm Find-L-Sequence with parameters: (A ,k — 
1 , H, T r \St jt ) to get a l-sequence: (A ± ,..., A 2k _ 1 ). Note that d(W, •S’q*) > 1 — A& or 
d(S tj «,W) > 1—Afc. If the former is true then outpid the sequence (Ai,..., A 2 k-i, A\,..., A' 2k _ 1 ). 
Otherwise output the sequence AL ok _ x , A\,..., A 2 k-i). 

Algorithm Find-L-Sequence runs in polynomial time. Its correctness will be proven later. Later 
we will also analyze its running time in more detail. 
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4.5.3 Algorithm Find-Clique 

For an undirected graph G and two nonempty disjoint sets: A,B C V(G) we define, by an analogy 
to the directed setting: d(A, B) = -ppjp where eA,B is the number of edges between A and B. 
Assume now that we have a fc-partite undirected graph with color classes: Vj,..., 14. Assume 
besides that for every 1 < i < j < k the following holds: d(Vj, Vj) > 1 — A, where: A < 2fc + lfc • Now 
we will present an algorithm Find-Clique that finds in this graph a clique: {vi,...,Vk} such that 
Vi G Vi for i = 1,2,..., k. 

Algorithm 4.5 (Algorithm Find-Clique finding a clique in a dense k-partite undirected graph) 

• Input: k-partite undirected graph with color classes: Vj,...,Vj; such that for every 1 < i < 
j < k the following holds: d(Vi, Vj) > 1 — A, where A satisfies: 0 < A < ^k+i k ■ 

• Output: A clique {r>i,..., v^} such that Vi G Vj for i = 1,2 ,...,k. 

• Description: If k = 1 then output an arbitrary vertex ofV\. Now assume that k > 1. Define 

W{ = \v G Vj : d({e}, Vi) < 1 — 2fcA} for i = 2, 3,..., k. For every i each set Wi may be easily 
computed. Having sets Wi, take a vertex v\ G Vj\(W 2 U ... U Wk). For every i = 2,3, ...,k 
compute Vj = Vj fl Nf, where Nf is the set of vertices adjacent to v\ in Vi. Run recursively 
algorithm Find-Clique on the (k — 1)-partite induced subgraph with color classes: V 2 ,...,V k , 
and update X: A —> jjz^kX) 7 ' ^ ou °^ a * n a se t { v 2 , Output the set {r:i, ...,Vk}. 

The algorithm clearly runs in poly(m&Xi = \ „ k |Vj|) time (note that k is a constant). Its correct¬ 
ness will proven later. 

4.5.4 Algorithm Find-M-Sequence 

Let T be a tournament with n vertices. Assume now that (A \,..., A 2 k+i) is a (c, A)-Z-sequence in 
T, where A < 4 ( 9 fc + f) 3 4 fc+a • Assume furthermore than there exists a subroutine Sub that for every 
i = 1,2, ...,k and for every subtournament T$ of A 2 i computes in time 0(h(\Ts\)) a transitive 
subtournament of Tg of size at least \Tg\ e for some given e > 0. Under these conditions we will 
show an algorithm Find-M-Sequence that computes in T a (c , A, e)-m-sequence of length 2k + 1 
for c = min((|) e , c) which in addition is (log(cn) — 2)-big. From the characteristic of the sequence 
we already know that the sequence is cn-big and asymptotically for large n this is a stronger 
property than being (log(era) — 2)-big. However for the corner cases for small n (that we will need 
for induction to prove that the algorithm produces a transitive subset in the iL-free tournament of 
the desired size) we will also need (log(era) — 2)-bigness property. 

Algorithm 4.6 (Algorithm Find-M-Sequence finding m-sequences) 

• Input: A (c, A) -l-sequence (A \,..., A 2 fc+i) in T for A < f or some parameter 

0 < A < 1 and a subroutine Sub that for every i = 1, 2,..., k and for every subtournament Tg 
of A^i computes in time 0(/i(|Ts|)) a transitive subtournament ofTg of size at least \Tg\ £ for 
some given e > 0. 

• Output: A (min((|) e , c), A, e)-m-sequence of length 2k + 1 in T which is (log(era) — 2 )-big. 
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• Description: Denote n = \T\. Let i £ 1,2 ,...,k. Compute a transitive subset Tr\ C A 2 i of 
size at least log(^) — 1 for i = 1 , 2 ,..., k. Such a subset always exists since \A 2 i\ > ^ (it. also 
can be efficiently computed, we will discuss this in more detail later). Now use subroutine 
Sub to compute a transitive subset Tr\ C A 2l of size |"(|) e n e ]. Remove from A 2 i the bigger 
subset from the set: {Tr\,Tr l 2 } and denote this bigger one by T{. Continue until the set you 
are left with is of size smaller than ^4^. Denote transitive subsets obtained in such a way as: 
T {,..., Tf.. Construct a (2k + 1 )-partite graph G with color classes: V \,..., V 2k +i as follows: 

- V 2 i + i = {A 2 i+i} for * = 1 , 2 ,..., k, 

- V 2i = {Tf,...,Tf} fori = 1,2 ,...,k, 

- make two vertices X £ V a , Y £ \4 for l<a<b<2k + l adjacent if d(X, Y) > 1 — A. 

Run algorithm Find-Clique on G to obtain a sequence (Yi, •••, Y 2k+ i) that induces a clique in 
G (Yi £ Vi for i = 1,2, ...,2k + l). Output (Y x , ...,Y 2k+ i). 

Denote A m = maxj = i i ... i 2 fc+i \A%\. The algorithm runs in time 0(h(A m ))n l ~ e + poly(n), where 
poly(n ) is a polynomial factor. Its correctness and running time will be proven later. 

4.5.5 Algorithm MakeSmooth 

Assume that we are given a sequence (C\, ...,C k ) that is either a (c, A, e)-m-sequence, a (c,X,e)-t- 
sequence or an (c, A (-/-sequence. Assume that S( C C\ for j = 1,2,..., s*. Assume furthermore that 
Si s are pairwise disjoint for j = 1,2,...,and \Sf\ > (1 — f)\Cj\ for i = 1,2,...,k, j = l,2,...,Sj, 
where 0 < / < 1 is some Hxed parameter. Denote L = s i + ... + s k - We will show an algorithm 
that extracts a subset from every S( and uses extracted subsets to construct a smooth (c , A , e)- 
m-sequence, a smooth (c , A , e)-f-sequence or a smooth (c , A )-/-sequence respectively, where c = 
|(1 — /) and A = Each element of the constructed sequence (m-sequence, t-sequence or 

/-sequence) is the union of the corresponding extracted subsets. The goal is to make an input 
sequence smooth by taking subsets, but in such a way that a significant fraction of elements from 
each part of the partition defined by sequences S( is used in the built smooth sequence. 

The naive implementation of the algorithm below clearly runs in polynomial time. Its correct¬ 
ness is proven in the next section. 


Algorithm 4.7 (Algorithm MakeSmooth) 


• Input: A (c,\,e)-m-sequence, a (c, A, e)-t-sequence or a (c, A) -l-sequence (C\, ..., C k ), a set 
of subsets S) fori = 1,2, ...,k, j = 1,2, ...,Sj such that Sf C Ci, 1511 > (1 — f)\Ci\ and S( 
are pairwise disjoint for i = 1,2, j = 1,2, ...,Sj and some 0 < / < 1. It is assumed 
that Si = 1 fori = 1 , 2 , ...,k if (C \,..., C k ) is a l-sequence, and Si = 1 fori = 1,3,5,... if 
(Ci,..., C k ) is a m-sequence (in other words, if s t > 1 then i corresponds to the transitive set 
in (C\,..., C k )). Denote L = s i + ... + s k . 

• Output: A smooth (c , , e) -m-sequence, a smooth (c , X',e)-t-sequence or a smooth (c , X')- 

l-sequence (C l , ...,C k ) respectively, where c = |(1 — /), A = jYzjjz an d C i C Uj = i 2 si^l 
for i = 1, 2,..., k. Besides we have: |C- fl Sf \ > ^(1 — /) for i = 1, 2,..., k, j = 1,2,..., s*. 
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• Description: Let us assume that (C±, ..., Ck) is a (c, A, e)-m-sequence. For two remaining 
cases the algorithm is completely analogous. For i,j G {1, 2, k}, i ^ j, t\ G {1,2,..., Sj}, 
t <2 G {1,2,..., Sj } denote by C Sj 1 the set of vertices of S '* 1 that: 

- are adjacent to at least (1 — ^ ] 2L ^ )| 6 'j 2 1 vertices of Sj 2 if i < j or 

— are adjacent from at least (1 — ^j^ )\Sj 2 \ vertices of S t2 if i > j. 


IWeCM’-rWo*. 

for i = 1,2,..., k and output (C 1 ,..., C k ). 


Cl£ fori = 1,2,...,/?, ti = 1,2,..., Si. Denote C[ = Ut=i,2,..., Si Cht 


4.5.6 Algorithm FindStrong-M-Sequence 

We will now show a technical algorithm that is fundamental for finding efficient coloring of an H- free 
tournament, where H is a constellation. Again as before, the naive implementation of the algorithm 
runs in a polynomial time. Its correctness is proven in the next section. The presented procedure 
is essentially a wrapper for the main algorithm FindStrong-M-Sequence-Main described next to it. 
This main algorithm operates on the smooth m-sequence that was obtained in the preprocessing 
performed in the initial phase of FindStrong-M-Sequence. The goal of the FindStrong-M-Sequence 
is to extract a strong m-sequence from an IL-free tournament, where H is a given constellation. 
The input to the procedure is a M- big m-sequence for an appropriate parameter M. 

Let us remind now some important terms regarding constellations. The interstellar graph of 
the constellation H under a star ordering 9 is an undirected graph, whose vertices are the sets of 
leaves of the stars of H under 9 and any two given vertices L± and L -2 are adjacent iff: 

• the left point of L\ precedes the right point of L 2 in 9 and 

• the left point of L 2 precedes the right point of L\ in 9. 

Take an interstellat graph of the constellation H. For each connected component C of the interstel¬ 
lar graph of Ft we denote by Z(C) the union of subsets of V{H) corresponding to its vertices (this 
is the union of some subsets of V(H) of the vertices of H). We define C(Z(C)) as follows. We say 

that a vertex v G C(Z(C )) if v G Z(C) or v is between some two vertices of Z(C) under the ordering 

9. Let C 1 ,..., Cf be the connected components of the interstellar graph. Note that for any given 
1 < i < j < k either every vertex of C{Z{Ci )) is before every vertex of C(Z(Cj)), or every vertex of 
C(Z(Cj)) is before every vertex of C(Z(Ci)). Thus there is a natural ordering of the sets C(Z(Ci)) 
for * = 1,2,..., k induced by the ordering of the vertices. Denote the ordered sequence of the sets 
C(Z(Ci)) for i = 1 , 2 ,..., k as (Wi,...,, V\4), where a set Wj is before a set Wj for 1 < i < j < k. 
Denote Wo = Wk +1 = 0- For i = 1, 2,..., k + 1 denote by lZ t the set of the vertices of H that are 
after all the vertices of Wj -1 and before all the vertices of Wj under the ordering 9. Note that if TZi 
is nonempty then all its elements are centers of the stars of H. Denote the set of nonempty sets IZi 
as {Mi,—,M r ] for some r > 0 . Note that {Wi,..., W&, M\,..., M r } is a partition of the vertices 
of H. We denote this partition by Pg(Ft). Each constellation satisfies the following: if some center 
of the star belongs to some P G Pg{H) then no leaf of this star belongs to P. 

Let us explain what the algorithm FindStrong-M-Sequence-Main is doing. We commented on 
it before when we were talking about techniques used in the algorithm, but now we will be more 
precise. The input to the algorithm is a long m-sequence. The algorithm tries to reconstruct a 
constellation H in the given tournament. It wants to achieve it by selecting \Pg(H)\ transitive 
chunks, where 9 is a constellation ordering of Ft, mapping each vertex of the constellation to one 
of the selected \Pg(F[)\ chunks and looking for it in the chunk it was mapped to (if several vertices 
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are mapped to the same chunk then the algorithm tries to find them in different subchunks in the 
subdivision of the given chunk). Each transitive element of the given m-sequence is a vertex of the 
undirected graph G that encodes the relation between different transitive chunks. An edge between 
two nodes in G indicates that throughout the execution of the algorithm a particular behaviour 
between corresponding transitive subtournaments was detected, namely one was detected to be 
adjacent to the other one. This type of relation is particularly precious since both transitive chunks 
were previously extracted from different linear sets so there was no reason to assume before that the 
relation was true (it will ultimately enable us to look for different leaves of the same star in different 
transitive elements of the m-sequence when we will look for H later using different approach). Graph 
G evolves during the execution of the algorithm as new pairs of transitive chunks such that one 
is adjacent to the other one are detected (the evolution is conducted by adding new edges as well 
as replacing with new transitive chunks the old ones in the vertex set V(G)). The algorithm tries 
to reconstruct H in the tournament induced by selected \Pg(H)\ transitive chunks star by star. 
Whenever vertices inducing a particular star are found we say that state 1 was reached. Since 
the reconstruction of the entire H cannot succeed (input tournament is H- free) at some point, by 
simple analysis based on the Pigeonhole Principle, the algorithm detects two substantial transitive 
subchunks, such that one is adjacent to the other one. Since the initial \Pg(H)\ transitive chunks 
were chosen as an independent set in G, when we replace in G the original two transitive chunks 
by the two found transitive subchunks, we also need to add one more edge. So from the point of 
view of graph G in each step we are taking its independent set of size \Pg(H)\ and we add an edge 
between some two vertices of this set. Since the graph has 2 h+1 nodes (and this number is the same 
throughout the execution of the algorithm), at every single step it has a clique or an independent 
set of size h. If an independent set of size h does not exist then we take an /r-clique and it it easy 
to see that it corresponds to the strong m-sequence (every edge of the clique indicates the relation: 
adjacent to between corresponding transitive subsets). Simple calculations lead to the conclusion 
that the constructed strong m-sequence has desired characteristic (in terms of size of its elements, 
etc). If an independent set is found then new edge is added. Now notice that since whenever there 
exists an independent set of size h an edge is added, at some point the clique of size h will appear 
in G anyway. Thus we will be always able to construct a strong sequence we are looking for. 

Algorithm 4.8 (Algorithm FindStrong-M-Sequence constructing a strong m-sequence) 

• Input: A constellation H with V(H) = {1,2,..., h}, a constellation ordering 6 = (1, 2,..., h), 
an H-free tournament T and a (c, A, e) -m-sequence in T of length k = 2 h+2 (h + 1) + 2h + 1 
which is M-big for M > 2 (h + 2) • 2 2&h+2 . It is assumed that A < 2 $ h+6 . 

• Output: An X-strong (c, A, e)-m-sequence in T of length k which is ± h+3 M-big, where: 
c = 224 l +3 c, A = 2 2 ° h A, k = 2 h? + Ah + 1, I = {h + 1, 2h + 1,..., h 2 + 1}. 

• Description: Run algorithm MakeSmooth to get a smooth (c , X', e)-m-sequence (Cj,..., Cjf), 
where c = A = 4A k (in algorithm MakeSmooth we take f = 0, S{ = 1 for i = 1,2,..., A; 
and Sj = Ci). Let y = 2 h + 1. Denote transitive sets: C x+ \, C* 2 X + 2 , Cs x+ 3 ..., C' 2 m - i x+2 ' i + 1 as 
Ti, ..., T 2 h+i respectively. Then run algorithm FindStrong-M-Sequence-Main (see description 
below), where the arguments for FindStrong-M-Sequence-Main are defined as follows: 

— input m-sequence is a smooth (c , X' ,e)-m-sequence (Ci,..., C^) computed above, 

— input graph G is an undirected graph with V(G) = {Ti, ...,T 2 h+i} and no edges, 
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— an independent set S is of the form: S = {T\, ..., T/j}, 

— input parameter a is the set of all stars of H, 

— input parameter £ satisfies: £ = 2 (h+ 2 ) > 

and the coloring of vertices of the m-sequence is done as follows: 

— for every Ti — 1,2 color j for j = 1,2 ,...,h is assigned to the vertices of the 

indices: (j — 1)L^^J+1, •••> the transitive ordering of 7\ (we use the convention 

that the first vertex in the ordering has index 1 ), all other vertices of Tj are colored by 

h + 1, 

— vertices of all other sets of the m-sequence are colored by h + 1 . 

Algorithm 4.9 (Algorithm FindStrong-M-Sequence-Main) 


• Input: A constellation H with V(H) = {1,2, ...,h}, a constellation ordering 6 = (1,2, ...,h), 
an H-free tournament T and a smooth (c, A, e) -m-sequence (C \,..., C]f) in T of length k = 
2 h+2 (h + 1) + 2h + 1. Every vertex of the m-sequence is colored by a color from the set 
{1,2,..., h + 1}. For every color j £ {1,..., h + 1} and every i £ {1,..., k} if there are vertices 
in Ci colored by j then at least £|C'*| of them are colored by j. Furthermore, a nonempty 
subset a of the set of stars of H is given. We are also given an undirected graph G with 
V{G) = {C x +\, C 2 X + 2 , C 3 x+ 3 -") C 2 h+i x+ 2 h+if, where x = 2/i + l, and an independent set S of 
G denoted as S = {T tl ’ •••’ Tt h } for ti < t 2 < ... < t h . 

• Output: An X-strong m-sequence of length k = 2h? + 4/i +1, where X = {h + 1, 2h + 1,..., h 2 + 
!}• 

• Description: Take an arbitrary star X* £ o. Take a partitioning Pg(H) and let assume that 
it is of the form: Pg(H) = {Pi, ...,P Z }, where vertices of Pi are before vertices of Pj under an 
ordering 6 for i < j and z is the number of elements of the partition Pg{H) (see: definition 
of Pg(H)). We will assume that X* is a left star. For a right star the algorithm is completely 
analogous. Let n c be such that the center o/X* is in P nc under 6 . Denote by m c the position 
that this center occupies in Pn c under ordering 6 (first vertex of P Uc under ordering 6 occupies 
position 1, second - position 2 , etc.). Assume that X* has q leaves and that all the leaves are 
in Pm f or some ni (note that from the definition of the constellation, ni is the same for all 
leaves of X* and is different than n c ). Denote by mi for i = 1,2,..., q the position that I th leaf 
occupies in P ni under ordering 6 . First we check whether there exists a vertex p £ T tn that 
is colored by m c and vertices r\,...r q such that: 

— n £ t) , 

I'Tll 7 

— r'i is colored by m; for i = 1 , 2 ,..., q and 

— (r % , p) is a backward edge fori = 1 ,...,q . 

As we have already noticed, since H is a constellation we know that n c f ni. If vertices 
p,r\,...,r q exist we say that state 1 was reached. Otherwise we say that state 0 was reached. 

Assume first that state 0 was reached. But then, by Pigeonhole Principle, there exists a subset 
C C T of at least {IT, I vertices that are adjacent to all vertices of T, colored by some 

— b nc ° Q 1 b Tic 1 ° ° u 

fixed color i* £ {1,2,..., h}. Indeed, if state 0 was reached then we could not construct the 
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embedding of the left star defined above. So no matter which vertex v of T' tn is taken as the 
center, the construction is not possible. Fix such a vertex v. We try to find leaves of the star 
in differently colored chunks, i.e. find a backward edge from a colored chunk to v for every 
color. If this is not possible then v is adjacent to all vertices for some particidar color col v . 
This is the place where the Pigeonhole Principle comes into action. Since altogether we have 
q colors, for at least ~|T tn J vertices v from T tn ^ the color col v will be the same. In other 
words, at least ||T tn J vertices v from T tnc will be adjacent to all vertices of T t colored by 
some fixed color i* £ {1, 2, h}. 

In this scenario we replace T[ in the m-sequence by C and T[ by the subset C of T[ 

± cn c x ^ «/ b n ^ 

consisting of vertices colored by i*. In the undirected graph G we replace vertex Tj by 
C, vertex by C (keeping all edges of G, new vertices inherit edges adjacent to vertices 
that they replaced) and add an edge between vertex C and vertex C. Then we run on our 
updated m-sequence (which is not necessarily smooth) an algorithm MakeSmooth to make it 
smooth (with the same parameters as in the preprocessing phase of the algorithm FindStrong- 
M-Sequence). In G we replace all vertices by corresponding subsets extracted from them 
during smoothing-procedure (edges are inherited from the old graph G). Then we recolor all 
the vertices of the new m-sequence we obtained using the same coloring procedure that we 
used earlier in the algorithm FindStrong-M-Sequence before calling algorithm FindStrong-M- 
Sequence-Main for the first time. We replace our collection of stars by the collection of all 
stars of H which we call cjh■ We check whether there is a clique of size h in G. Assume 
first that there is not. Then, since G has 2 h+l vertices, it has an independent set S* of size 
h. We rerun algorithm FindStrong-M-Sequence-Main with updated parameters c,\,£,cr,S: 
c —> A —> f —> 2 (h+ 2 ) ’ a ° H ) S' —i► S* and updated graph G. Assume now that the 

clique of size h was found. Then note that we can easily extract from (C\, ..., Ck) an 1-strong 
subsequence of length k (this subsequence in particular contains all vertices of the clique). We 
output it. 

It remains to consider scenario when state 1 was reached. If this is the case we remove £* from 
a. Let X be the set of vertices from T tnc colored by m c and let Yi for i = 1,2, I be the set of 
vertices from colored by mi. Let Dij be a set of vertices from Q\(XUY’iU...UY^) colored by 

color j. Denote by nJ? 1 ’ 3 C D t j the subset of Dij consisting of vertices of Dij adjacent from p 
if p is before all vertices of Dij in the m-sequence and adjacent to p otherwise (the ordering in 
the m-sequence is induced by an ordering of sets Ci and a transitive ordering within transitive 
parts of the m-sequence). Similarly, denote by N^f’ 3 C Dij for u = 1,2, ...,q the subset of 
Dij consisting of vertices of D^j adjacent from r u if r u is before Dij in the m-sequence and 
adjacent to r u otherwise. Denote N Di ^ = N^ 1 ' 3 nNp''"- 1 , where: N^ r ’ J = (\=i 2 ■ We 

run algorithm MakeSmooth on the given m-sequence, where we have: S( = N Di ^, parameter 
f satisfies: (1 — /) = £(1 — ^r) and get a new smooth m-sequence. In this new m-sequence the 
coloring is inherited from the old one. Now we rerun algorithm FindStrong-M-Sequence-Main 
with updated parameters c, A.£: c — > ^(1 — A —> ^ ^|(1 — 4r), a —> a \ {£*}. 

5 Analysis of the algorithms 

In this section we formally prove correctness of the algorithm that colors H- free tournaments, where 

H is a constellation. As a corollary we prove Theorem 12.11 
We start with some introductory observations: 
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5.1 Let T be a tournament. Assume that for two disjoint subsets X,Y C V(T) the following holds: 
d(X,Y) > 1 — A for some A < 1. Assume that X\ C X, Y\ C Y, X\ > ci|X|, Y\ > c 2 |y| for some 
0 < ci, c 2 < 1. Then d{X 1 ,Y 1 ) > 1-^. 

Proof. Let e^x be the number of directed edges from Y to X and let ey^Xi be the number of 
directed edges from Y\ to X\. We have: ey.x = (1 — d(X,Y))\X\\Y\ < A|X||y|, since d(X,Y) > 
1—A. Similarly: ey 1; Xi = (1—d(Xi, Yi))|Xi||Yi|. Assume by contradiction that d(X±, Y\) < 1 — 
Then, since X\ > ci|X|, Y\ > c 2 |y|, we have: ey 1> x 1 > A|X||y|. Since ey,x > ey 1; Xi, we get: 
eyx > A|X||y|, contradiction. 1 

5.2 Assume that every subtournament T$ of a tournament T contains a transitive subtournament 
of order at least |Ts| e for some e > 0. Then x(T) < n 1 ~ e log(n). Besides if in every subtournament 
Ts ofT one may find a transitive subtournament of order at least |T,s| e in time 0(h(\Ts\)) for some 
nondecreasing function h, then the proper coloring of T using at most n 1 ~ e log(n) colors may be 
constructed in time 0(n 1_e log(n)/i(n) + n 2 log(n)). 

Proof. In the preprocessing phase we sort each adjacency list. This requires 0(n 2 log(n)) time. 
Find a transitive subtournament Tf of T with |Ti| > (^) e and delete it from T. To perform a 
deletion we first sort the vertices of the found tournament and this can be done in 0(nlog(n)) 
time. Then we get rid of all the adjacency lists that are related to the vertices from the found 
tournament. This can be done in 0(nlog(n)) time simply by going through each adjacency list and 
performing a binary search in the sorted sequence of the vertices from the transitive subtournament. 
Finally we delete vertices of the transitive subtournament from all remaining adjacency lists and 
this can be done in 0(n|Tj| log(n)) time. We keep finding transitive subtournaments of order at 
least (l|) e as long there are at least § vertices in the tournament. The total time spent for running 
this subprocedure is: 0(/r(n)n 1-e + nlog(n) + n|Ti| log(n)). When we reach the state with less than 
^ vertices remaining, we have found 0(n 1-e ) transitive subtournaments: Ti,T 2 , .... We then apply 
the same subprocedure on the remaining graph of less than § vertices. We stop when there are no 
vertices left and by that time we have partitioned tournament T into transitive subtournaments. If 
we denote by H (n) the number of the transitive subtournaments found then we have the following 
simple recurrence formula: H(n ) < (^) 1 ~ e +iL(^ : ), which immediately gives us: H(n ) <n 1-€ log(n). 
By coloring each transitive tournament with the same color and using different colors for different 
transitive subtournaments we get a proper coloring of T that uses at most n 1_e log(n) colors. If we 
denote by T(n) the total running time of the algorithm then the above observations (and simple 
calculations) give us the following formula: T(n) = 0(n 1-e log (n)h(n) + n 2 log(?r)). That completes 
the proof. 1 

The following theorem turns out to be very important to prove the correctness of our coloring 
algorithm. It also explains how a polynomial lowe bound on the size of the transitive subtournament 
can be obtained. 

5.3 Let be T be a tournament and let A\,T\ C V(T). Denote by T Al a tournament induced by 
A\. Assume that (AyTj) is (c, e c ) -saturated, c > 0, and T Al is e c -transitive, where e c = 

Assume furthermore that a transitive subtournament of T Al of size at least |Ai| £c might be found 
in time h(\A\\) for some function h. Then T is e c -transitive and its transitive subtournament of 
size at least \T\ €c might be found in time /i(|Ai|) + t m , where t m is the size of the largest transitive 
subtournament of T. 
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Proof. Denote n = |T|. Let T' be a transitive subtournament of order at least |Ai| £c , found in T Al . 
Note that if we merge it with a tournament induced by T\ then we get a transitive subtournament. 
It only suffices to prove now that this bigger transitive subtournament, denote it as T l , satisfies: 
\T l \ > n £c . We have: \T l \ > ( cn) £c + cn ec , since |Ai| > cn. Since e c = we () ^ )ta ' n: 1^1 — n £c . 

That completes the proof. | 

Now we prove correctness of the algorithm 14.41 and analyze its running time. Let us denote: 

A* = ( 47 ^ 3 k) Vh21 for i = 0,..., k. 


5.4 If \T\ > - - ^fhk 1 ^ 1 then algorithm \4-4\ constructs a (c, \)-l-sequence for c = 

- 

2 k is the length of the sequence and h is the size of the forbidden subtournament, 
polynomial time. 


\hk 

where 
and runs in 


Proof. Note first that A i < Jff A for i = 1,2,.... We call this property of the sequence {A*} for 
i = 0,1,... the a-property. The algorithm trivially works for k = 0 so we can assume from now 
on that k > 0. Let h = \H\ and n = \T\, where H is a forbidden tournament. Note that the 
algorithm stops when the subroutine 14.11 is called with h r = 1 or the algorithm 14.41 itself is called 
with k = 0. Note also that if the former holds then the last h calls in the recursive call-tree on the 
path ending at that call were the calls of the subroutine 14.11 But then we can take last h vertices 
v* found in h last calls of the subroutine ED and they induce a copy of H in T, contradiction. Let 
us explain in detail why this is the case. Take the first vertex v* from the sequence of h consecutive 
ones and call it rj. Notice that the remaining h — 1 consecutive calls will operate on the set of 
sets {Nj(u*)}. The remaining h — 1 vertices v* induce a copy of H\{v*} in such a way that in the 
embedding each vertex of H\{vl} resides in the different set Nj(vl). But then we can take this 
embedding of 14\{uJ}, add vertex uj and from the definition of the sets Nj(v*) we conclude that the 
constructed set of h vertices induces a copy of H. Thus if the algorithm stops then algorithm 14.41 
is recursively called with k = 0. Note also that the algorithm must stop since in the recursive 
call-tree there does not exist a path of h calls of the subroutine 14.11 (from what we have said so 
far) and whenever algorithm 14.41 is called parameter k is being decreased by 1. Note that, since 
n > ———, whenever algorithm 14.41 is recursively called, the H -free tournament it operates 

A fc n—n 

on is of size at least 2 (h + 1). To see this, consider one call of the algorithm 14.41 and let i be 
its second parameter. Note first that after one call of algorithm 14.41 and at most h consecutive 
calls of subroutine ED the iL-free tournament T 2 which is the last parameter of the next call of 
algorithm I4.4l is of size at least — = I?" 1 1 ( 7^1 — j^fry) where T 1 is the last parameter 

of the previous call of algorithm 14.41 Finally, note that on the path of the tree of recursive calls 
there are at most k consecutive calls of algorithm 14.41 and that Afc < A i for i = 0, l,...,k — 1. 
Assume now that in the subroutine El we reached the state when the two sets W, St » were found 
(see: description of the algorithm from the previous section). Assume without loss of generality 
that d(W, S tjt ) > 1 — Aj. Denote by uq the size of the tournament T r which is the parameter of 
the last call of algorithm 14.41 preceding in the call-tree the construction of W and St jt ■ Note that 
inductively sequences (A\,A 2 k- 1 ) and (A^,A' 2k _ 1 ) are both ( c(H,k — 1, A), A)-Lsequences. 
Note also that we have: |W|,|St.*| > s, where s = LfhJA^FD- (this comes from the previous 
observation that between two consecutive runs of algorithm 14.41 we have at most h recursive runs of 
the subroutine EE]). We have: s > — l)A.f 77 x 7 , thus s > w- 0 ( 77^7 ~ ■ Since at any point 

of the execution of the algorithm an iL-free tournament we are dealing with has size at least 2 (/i+l), 

we can conclude that \W\, \St jt , \ > /no, where / = 2 (h^_i) ~ Then, since A* < Xc z (H , i — 1, A) (which 


22 









follows from the a-property) and c(H,i, A) < fc(H,i — 1, A), using Theorem 15.11 we can deduce 
that the sequence {A\,A 2 k-i, A 1 ,A 2k _ x ) is a (c(H, k, A), A)-/-sequence. 

To prove that algorithm 14.41 runs in polynomial time, note that time spent by the algorithm 
between two its recursive consecutive runs on the path of recursive calls is polynomial. Therefore, 
if T(k) denotes time spent by the algorithm to find a /-sequence of length 2 fc , then we have: 
T(k) < poly(n) + 2 T(k — 1). Since A: is a constant, T(k) is clearly polynomial in n. 1 

Now we prove correctness of the algorithm 14.51 

5.5 Algorithm \f.5\ computes a clique {ui,..., Vk}- 

Proof. Assume first that at each stage of the algorithm sets AT are nonempty. We will prove 
it later. Note that from Theorem 15.11 we know that \Wi\ < Thus we have \W 2 U ... U Wk\ < 

IW 2 I U ... U \Wk\ < k^- < Thus a set V\\{W r 2 U ... U Wk) is nonempty so we can always find 
v \. It is obvious that if we combine the clique found in the next call of the algorithm 14.51 with 
vertex v\ then we get a clique. Note also that since |AT| > (1 — 2kX)\Vi\, by Theorem 15.11 we 
have: d(Vj , V)) > 1 — for 2 < i < j < k, so we update the parameters of the algorithm 

correctly. Notice that the first run of the algorithm 14.51 is for A < -^fe+r and tluat altogether there 
are exactly k calls of the algorithm, where in each call we update: A —>■ jjzz^kXf 1 ' Thus eac h time the 
algorithm 14.51 is called we have: A < ^. Therefore in particular, whenever sets AT are calculated 
they are always nonempty. That completes the proof of the correctness of the algorithm 14.51 9 

Now we prove correctness of the algorithm 14.61 and analyze its running time. 

5.6 Algorithm \4-6\ computes a (min(c, (§) e ), A, e)-m-sequence which is (log(cn) — 2 )-big and runs 
in time 0(h(A m )7i 1 ~ e ) + poly(n), where poly(n) is a polynomial factor. 

Proof. Denote n = |T|. Denote XP 2i = T{ U ... U T ' L r . for i = 1,2, Note that > XX for 
i = 1,2,..., k. Denote A' 2i+1 = A^+i and A' 2i = A 2i for i = 1, 2,..., k. Then, using Theorem 15.11 we 
can deduce that d(X i ,X-) > 1 — 4A for 1 < i < j < 2k + 1. Indeed, |A-| > ^ and \Aj\ > and 
furthermore: d(Ai,Aj) > 1 —A. Now take some A % and A - for 1 < i < j < 2k+ 1 and corresponding 
sets Vi and Vj in G. Note that since d(A' i ,A'j) > 1 — 4A, using Theorem 15.11 we can conclude that 
the number of edges going between Vi and Vj is at least (1 — Ao) | 111, where Ao = ■ 

Let us explain in detail why this is the case. Assume otherwise. Then there are at least Aq |A?; 11Aj | 
pairs of elements (x, y) from Vi and Vj that are not adjacent in G (notice that each x and y is a 
subset). Each element from x has the same size and each element y has the same size (even though 
the size of x does not have to be the same as the size of y). Denote the size of each element of V) 
by si and the size of each element of Vj by S 2 . Then we can conclude that there are more than 
Ao I Vi | \ Vj |siS 2 A edges going from A- to A i (since by definition of the nonedge of G , each nonedge 
(x,y) introduces at least A|x||y| edges from X- to A'f). Notice that the size of X i is |Vj |si and the 
size of A'j is | V 2 1« 2 - Thus there are more than AoA|A^||A^| edges going from X- to X % . On the other 
hand, since d(A i ,Aj) > 1 — 4A, we know that the number of edges going from A - to A- is at most 
4A|A-||A^| < AqA|A-||A^- 1, where the last inequality follows from the fact that (by assumptions of 
the theorem) A < Aq j- We get a contradiction. Thus indeed the number of edges going between V) 
and Vj is at least (1 — Ao)|Vj||I^j. But then we see that all conditions necessary to run algorithm 14.51 
are satisfied. The parameter k in the statement of the algorithm 14.51 correspond to 2k + 1 in our 
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setting since our m-sequence is of length 2k+ 1. Thus Ao from our setting corresponds to the upper 
bound on A from the statement of algorithm 14.51 

Note also that a clique found by this algorithm in G corresponds to the (c , A, e)-m-sequence for 
c = min(c, (§) e ). This comes from the fact that each T'- satisfies |Tjj > (§) e n e . This m-sequence 
is (log(cn) — 2)-big since each extracted T- satisfied: |Tjj > log(cn) — 2. This is the case since 
tournaments Tj are extracted from tournaments of size at least s > ^ and by classic Ramsey 
argument, each such tournament has a transitive subtournament of order at least log(s) — 1. 

Ley us analyze now the running time of the algorithm. Note first that if W is an IV-vertex 
tournament then its transitive subtournament of size at least log(IV) — 1 may be found as follows: 
Take an arbitrary vertex v\ £ V(W). Let N~ be the set of its inneighbors in W and let be the 
set of its outneighbors in W. If N~ > then let W\ = W\N.~, otherwise let W\ = W\. Now 
consider a tournament W\ and repeat the procedure by taking an arbitrary vertex V 2 £ V(W\) 
and considering sets of its inneighbors and outneighbors in Wi, etc. Using this procedure we get 
the sequence of vertices: vi,...,v r for some r. It is easy to see that r > log(IV) — 1 and that 
{t>i, ...,v r } is a transitive subset. Trivial implementation of this algorithm clearly runs in poly(N) 
time. We use the procedure we have just described to find transitive subtournament of size at least 
log(^) — 1 in algorithm 14.61 Note that the only possibly nonpolynomial part of the running time 
corresponds to extracting transitive subtournaments with the use of procedure h. Fix some Aj. 
Each transitive subset extracted from A, is of size at least (^^) e . Thus the number of extracted 
transitive subsets from any given Ai is 0(|Aj| 1_e ) = 0(n 1-e ). Extracting a transitive subset requires 
time 0(h{\Ai\)) = 0(h(A m )). Finally note that we have a fixed number of sets A,. That completes 
the analysis of the running time of the algorithm. 1 

Now we prove correctness of the algorithm 14.71 

5.7 Algorithm computes a smooth (c ,A , e) -m-sequence, a smooth (c , A ,e)-t-sequence or a 
smooth (c , A )-l-sequence (C^,..., C k ) respectively, where c = §(1 — f)> ^ = (l-f) 2 > T = s i + --- + s fe 

and C[ C Uj=i 2 si f or ^ = 1, 2, ..., k. Besides we have: \C[ n S] \ > ^(1 — /) for i = 1, 2, ..., k, 
j = 1 , 2 ,..., Si. 


Proof. We assume without loss of generality that a (c, A, e)-m-sequence is given in the input. 
Let T be a tournament with the (c, A, e)-m-sequence and denote n = |T|. Note first that from 
Theorem 15.II we know that d(S t i 1 , Sj 2 ) > 1 — Using Theorem 15.II again we can conclude that 

| Sj 1 \Cj’f 2 1 < . To see that assume without loss of generality that i < j (for i > j the analysis is 

exactly the same). Note first that d(S 1 : 1 , Sj 2 ) > 1 — Thus the number of directed edges from 

Sj 2 to Sj 1 is at most n |5* 1 ||5 I * 2 |. However if |*S , * 1 \C/’/ i 2 | > then the number of vertices of 

S'* 1 that are adjacent to less than (1 — ^ ] 2L ^ )|S* 2 | vertices of S* 2 is more than which clearly 

indicates that the number of directed edges from S* 2 to S* 1 is more than I 11 • This is a 

contradiction according to what we have noted before. 

Therefore we have: |C<*| = . 2 < 110tice 

that the number of terms in the sum from the last sequence of inequalities is at most L ). Now 

take a vertex v £ C z,tl and take some j > i, t 2 £ {1, 2 ,..., sy} - Note that there are at least 

| 5 * 2 | 

(1 - vertices in |S* 2 | adjacent from v. Since \C j ’ t2 \ > we can conclude that 


_ \ | 

d({v},Ci ,t2 ) > 1 — (1 ~^ 2 | 3 Thus d({v},Ci ,t2 ) > 1 — Similar analysis can be done for 


\s. | 

' _ 3 _ 1 
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j < i, t 2 £ {1, 2, j}. Now note that | C[ fl Sj\ > ^ p cn . That completes the proof. | 

Now we prove correctness of the algorithm 14.81 

5.8 Algorithm \4-8\ computes an 1-strong (c, A, e)-m-sequence of length k! which is 4 h +3 M-big, 
where c = 224 l +3 c, A = 2 2 " h A, 1 = {h + 1, 2h + 1,..., h 2 + 1} and k' = 2hi 2 + Ah + 1, assuming that 
A<^ andM>2(h + 2)-2 28h+ \ 

Proof. Note that from the definition of a graph G and the way it is updated it is clear that when 
the algorithm stops it returns an Z-strong m-sequence. Indeed, every clique {Cq, Ci 2 ,..., C \ h } in G 
satisfies: d(C tji , ) = 1 for 1 < j\ < j 2 < h. Now note that state 0 may be achieved throughout 

the execution of the algorithm at most (1 — j^i) + 1 times, where |P(G)| = 2 h+l . This 

is true since when state 0 is achieved a new edge is added to G. Graph G cannot have more 
than ^ ^ (1 — + 1 edges since, by Turan’s Theorem, if G has \ v ^)\ (i _ _i_) _|_ \ edges 

then it has a clique of size h. Note also that state 1 may be achieved at most h — 1 times in a 
row since otherwise we can merge stars found at each of the consecutive stages when state 1 was 
achieved to reconstruct H. That contradicts the fact that T is i7-free. We can conclude that the 
algorithm stops. It only suffices to show that all its parameters are correctly updated. This comes 
directly from algorithm 14.71 that was analyzed before, Theorem 15.11 and the fact that at every stage 
of the algorithm each transitive set of the m-sequence has at least 2{h + 2) vertices (thus every 
expression of the form L^i J > where T is a transitive part of the ?n-sequence, may be bounded 

from below by 2 (fe+i) )• We call this last property an fl-property and will prove it later. Knowing 
that, we are ready to analyze in more detail the updates when states: 0 and 1 are reached. To see 
that whenever state 0 is reached all parameters are correctly updated, notice that while replacing 
elements of the m-sequence we decrease the size of each set of the sequence at most ^ times. Now 
assume that state 1 was reached. Note that each D t .j satisfies: \D tJ \ > £|Ci|. Since the m-sequence 
is smooth.bv Theorem lSJl we have: d(p,Dij) > 1 — j. Thus we have |iVp l,j \ > c£(l — ^). Similarly: 
|IV£| > c£(l — j) for i = 1 Therefore we have: | N Di ^\ > c£(l — ^). But then we can run 

algorithm 14.71 with (1 — /) = £(1 — ^). That observations enables us to finish the analysis of the 
parameters’ updates when state 1 is reached. 

It remains to prove an fl-property. The fact that at every stage of the algorithm each transitive 
set of the m-sequence has at least 2{h + 2) vertices is implied by our next remark. One can notice 
that under our choice of the initial values of parameters c, A we have: ^ < \ and £ > 
at every stage of the algorithm. Note that when state 0 is reached and we update the parameters, 
we have: £ = 2 (h+ 2 ) • On the other hand, since ^ when stage 1 is reached and we update 

the parameters, we have: f new > where £ ne w is the value of £ after the update and is the 
one before the update. Thus, since state 1 may be achieved at most h — 1 times in a row, we get: 
£ 9 2 h--iq t+ 9 ) a t every stage of the execution of the algorithm. Now, from what we have said so 

far, we can conclude that whenever state 0 is achieved we have: X new < Ah 2 k{h + 2) 2 2 4/i ~ 2 A 0 m and 
whenever state 1 is achieved we have: \ new < 16k(h + l)(h + 2) 2 2 4/l ~ 2 A 0 z ( i, where: X new is the value 
of A after the update and A m is the one before the update. Thus at every stage of the algorithm 
we also have: A < 2 2 fa (q, +2 ) ^imt, where A i n u is value of the parameter A at the very beginning of 
the algorithm. Whenever state 0 is achieved we also have: c new > 2 ^h.(fc+ 9 ) an< ^ whenever state 1 
is achieved we have: c new > 2 ^ h f °^ +2 ) > w l iere: °new is the value of A after the update and c 0 id is 
the one before the update. Now note that k < 2 2h+3 . Notice that from what we have said before 
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we know that state 0 is achieved at most 2 2h+2 times during execution of the algorithm 14.81 and 
state 1 is achieved at most ( h — \)2 2h+2 times. All these observations and some simple calculations 
imply that at every stage of the algorithm each transitive set of the m-sequence is indeed of size at 
least 2 (h + 2). Therefore ^-property is satisfied. 

Thus an I-strong m-sequence output by the algorithm is a (c, A, e)-m-sequence which is cM -big 
for parameters c, A defined in the algorithm. | 

Now we prove correctness of the algorithm 14.11 

5.9 Assume that we are given a m-sequence of the H-free tournament T with h = \H\. Assume 
furthermore that this sequence is an 1-strong (c, A, e)-m-sequence of length k = 2 h 2 + Ah + 1, where 

A < o2 s h -i } ■ n = \T\ > 2 , M > 2 2lhj2 , e = and c = Then algorithm ^. 1\ computes a 

transitive subtournament of T of order at least \T\ e . 

Proof. Our analysis is very similar to the one conducted in the proof of the correctness of the 
algorithm 14.81 Note that it cannot be the case that during the execution of the algorithm all the 
stars of H were found and a was empty at some point of the execution since after combining these 
stars one can reconstruct H in T which contradicts the fact that T is H- free. Note also that after 
our choice of initial values of the parameter A during the entire execution we have: ^ ^ and 

at every point of the execution of the algorithm each set of the m-sequence under consideration is 
of size at least 6 (this follows by simple but tedious calculations, similar to those presented in the 
analysis of algorithm 14.8j) . Therefore now we can repeat analysis of the algorithm 14.81 Thus we 
will not discuss details again. Note only that during the entire execution of the algorithm we have: 
£ > -^+ 2 - This is true since whenever we update parameter £ we have: f new > where £ neu , is 
a value of £ after the update and £ 0 ^ is the one before the update. Similarly, whenever we update 
parameter A we have: \ new < 4 2h+e k\ 0 id, where X new is a value of A after the update and A dd is 
the one before the update. And finally, whenever we update parameter c we have: c new > 
where c new is a value of c after the update and c Q id is the one after the update. Thus at every point 
of the execution of the algorithm we have: c > where: q, is the value of the parameter c at 
the very beginning of the algorithm. The only new part that we will focus on concerns running 
procedure V. Note that after running it we obtain an (^j-, e)-saturated pair. Thus it suffices to 
note that, because of Theorem 15.31 under our choice of e, constructed transitive subtournament is 
of order at least |T| e . | 

Let us prove now that the algorithm 14.31 is correct. We will also prove Theorem 12.11 


5.10 Let H be a constellation of order h and let T be an H-free tournament. Then the algo- 
rithm \4--i\ outputs a proper coloring of the vertices of H that uses at most |T| 2 250h +1 log(|T|) 

i 

colors. Besides the first color class it finds is a transitive tournament of order at least |T| 2 250h2+1 . 

Proof. Correctness of the algorithm is a simple consequence of the fact that algorithms: 

14.4114.6114.81 and 14. II are correct. In particular, under our initial choice of parameter A we have: c = 
2 $l h i during execution of the algorithm l4.ll Thus we can take as an e every positive value no greater 

than lo ^ c ^ . Note that in order to use algorithms: 14.4114.6114.81 and 14.II the conditions on the sizes 
of the elements of m/Lsequences given as an input need to be satisfied. One can check that all those 


conditions are satisfied whenever a tournament T we proceed with is of size at least: 2' 
And one can also easily note that for e\ = we trivially have: N ei < 2 for n < 2' 


,,10fe^+30h 

, 9 10fc 2 +30fi 
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2 10/i 2 +30h 

Thus every n- vertex tournament T with n = \T\ < 2 2 ~ contains a transitive subtournament 

of size at least |Ej ei . Therefore the EH coefficient of a constellation H is at least ——-. That, 
because of Theorem 15.21 completes the proof of the correctness of the algorithm 14.31 Let us 
analyze the running time of the algorithm. The only, possibly nonpolynomial factor comes from 
the execution of the algorithm 14.61 and from the execution of the algorithm 14.11 (to be more precise: 
from extracting transitive subtournaments by procedures: Sub and V). Thus let us take advantage 
of the analysis of the running time of the algorithm 14.61 and the algorithm 14.11 Denote by T (n) 
the running time of the algorithm 14.11 We have the following straightforward recursive formula: 
T(n) < 0(T(( 1 — c)n)n 1 ~ e ) + poly(n ), where poly(n) is a polynomial factor. Similarly, if S(n) is the 
running time of the algorithm 14.61 then we have: S(n ) < 0(S((1 — c)n)n l ~ e ) + poly(n). If we denote 
by G(n ) the running time of the algorithm 14.31 then we have: G{n ) = 0(n(T(n) + S(n))) + poly(n). 
Thus one can check that for C > 0 large enough function e < -' log ( n ) is an upper bound for T(n), 
S'(n) and G(n). | 


6 Further remarks 

In this section we discuss some applications of the techniques presented in the paper. 


6.1 The Erdos-Hajnal conjecture for constellations 


The presented algorithm gives a constructive proof of the theorem stating that every constellation 
satisfies the Erdos-Hajnal conjecture. The theorem was first proven by [4]. However that proof 
was not constructive. Besides, because it used the regularity lemma, it gave much weaker lower 
bounds on the EH coefficients of constellations. Our bound is also very small but needless to say, 
it is much bigger than the best lower bound that can be obtained with an approach that uses the 


regularity lemma. We prove that the EH coefficient of a constellation H is at least 
h = \H\. 


2 250h 2 + l ’ 


where 


6.2 The Erdos-Hajnal conjecture for nonprime tournaments 

We start with one more useful notation. For any tournament H with vertex set V ( H ) = {v \,..., Vh} 
and for any tournaments Ej,..., E), let H(F \,..., EX) denote the tournament obtained from H by 
replacing each Vi with a copy of Ej, and making a vertex of the copy of Ej outadjacent to a vertex 
of a copy of Ej, j ^ i, if and only if ( Vi,Vj) £ E(H). The copies of F t . i = 1,..., h, are assumed to 
be vertex disjoint. 

Let EE, F be tournaments satisfying the Erdos-Hajnal conjecture with some e{H),e{F) > 0. Let 
V(H) = {wi,..., Vh}- Denote by H(F, V 2 , Vh) the tournament obtained from Ef by replacing v\ 
by F. We say that a tournament H(F,i> 2 , ■■■,Vh) was obtained from H and F by a substitution 
procedure (we substitute v\ with F). Note that if |Ej > 1 and h > 1 then H(F, ,..., Vh) is not prime 
since V(F ) is a nontrivial homogeneous set. It was proven in [Ij that tournament H(F, V 2 , Vh) 
also satisfies the conjecture with e(H(F,V 2 , —,Vh)) = de{H) for every 6 < jnpyf ■ To be more 
precise, in [1] the analogous result for undirected graphs was proven and the proof of the directed 
version was not given explicitly. However a proof of the directed version is completely analogous to 
the one for the undirected version - cliques/stable sets are replaced by transitive subsets and induced 
subgraphs by subtournaments. Let T be some family of tournaments for which the conjecture is 
known and let T be the closure of T under taking substitutions. Then we can conclude that every 
member of F also satisfies the conjecture. Besides, if we can construct a polynomial-size transitive 
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subtournament for every member of T then we can also construct an algorithm that can do the 
same for every member of J~. It is so since the proof that substitutions preserve the Erdos-Hajnal 
property, as presented in [I], is constructive. Thus as a corollary of the algorithm presented in the 
previous section we obtain algorithms for coloring H -free tournaments with 0(n 1_e log(n)) colors, 
where H is taken from the closure C. of the family of constellations C. In fact, techniques used 
in the algorithm from the previous section may also be used for some tournaments that are not 
constellations (see: the next subsection) thus we obtain coloring algorithms for even larger classes 
of tournaments. We should note here that a straightforward algorithmic version of the proof that 
the substitution procedure preserves the Erdos-Hajnal conjecture has a sub-exponential running 
time e^ n ) for some constant 0 < k < 1 ) since it needs to examine all subsets of size |br <5 ~| of 
the n-element set (see: pQ, pages: 4-5). Therefore whenever we use an algorithmic version of the 
substitution procedure we get a coloring algorithm that runs in the sub-exponential time. Since 
the algorithm presented in the previous section required only quasi-polynomial time, the question 
arises whether it is possible to get an algorithmic version of the proof of the substitution procedure 
that also requires only quasi-polynomial time. It seems that the method used in the proof proposed 
in p]J cannot be easily modified to improve the running time. However a completely different proof 
may potentially have this property. Interestingly, the algorithm proposed in the previous section 
to color H-free tournaments, where H is a constellation, uses different techniques from those that 
were used in [I] to prove the mentioned property of the substitution procedure and that enabled 
us to obtain quasi-polynomial running time. An open question is whether this running time may 
be improved to polynomial. 

6.3 The Erdos-Hajnal conjecture for small tournaments 

It turns out that many tournaments may be obtained from constellations by the substitution 
procedure. In particular, this is true for all tournaments on 5 vertices expect for the tournament 
C 5 (see: Theorem ED. Thus, according to what we have said before, for all tournaments H on 
at most 5 vertices except the tournament C 5 we get an algorithm running on an arbitrary H -free 
tournament, finding its polynomial-size subtournament and coloring requiring only 0 (n 1_€ log(n)) 
colors. At the same time we get a constructive proof of the Erdos-Hajnal conjecture for those 
tournaments. In fact we can say even more. It is true (though we will not show it here) that a 
similar method that was used in the algorithm presented in this paper may be used along with 
the methods presented in [5] to give a quasi-polynomial time algorithm that colors every n-vertex 
Co-free tournament with O (n 1 ^^ 5 ) log(n)), where €( 65 ) can be exactly calculated and given in 
closed-form (again, the regularity lemma is not required). 

We do not present that algorithm in this paper because of length constraints. The idea behind 
the proof is that we can construct an arbitrary m-sequence in the same algorithmic way as we did 
in this paper for constellations. This is true since the construction of the m-sequence does not use 
the specific structure of the constellation. The only thing we need to know is that a tournament is 
defined by a forbidden pattern H. When we have the m-sequence we try to reconstruct C 5 using 
one specific ordering of its vertices under which the graph of backward edges is a tree. Since the 
input tournament is CVfree we wont be able to succeed. Then we show that we either get a linear 
set exactly adjacent to/from the big transitive chunk (as in the constellation proof) and that by 
induction immediately leads to the explicit bound on the size of the transitive subtournament or we 
obtain another graph of backward edges. The trick now is to show that this other graph of backward 
edges also corresponds to C 5 . That completes the proof. In general, whenever the nonconstructive 
proof is given, where the m-sequence is obtained at the very beginning and then some Pigeonhole 
Principle approach is used to get a linear set exactly adjacent to/from a big transitive chunk, our 
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algorithmic framework may be used. Since we do not use the regularity lemma, we do not rely 
on the bounds provided by this tool to obtain lower bounds on the sizes on the elements of the 
m-sequence. That leads to the explicit lower bounds on the EH coefficients. 

Let us also introduce few small tournaments that are not constellations but play important role 
in the research on the conjecture for small forbidden patterns. We have already introduced Cq - a 
unique tournament on 5 vertices for which every vertex has indegree 2. Let Tq be a tournament 
with V(Tq) = {1,2, ..., 6 } such that under ordering (1, 2, 6 ) of its vertices the only backward 
edges are: (4,1), (6, 3), (6,1), (5, 2). Let Tg 1 be a tournament with V(Tg) = {1,2, ...,6} such that 
under ordering (1, 2,..., 6 ) of its vertices the only backward edges are: (4,1), (5,1), (5, 2), ( 6 , 3). Let 
Tg be a tournament with ^(Tg 1 ) = { 1 , 2 , ..., 6 } such that under ordering ( 1 , 2 , ..., 6 ) of its vertices 
the only backward edges are: (1,3), (2,3), (2,4), ( 6 , 5). 

We just note that, as in the proof presented in [5], the algorithm for C 5 uses two orderings 
of the vertices of C 5 : ordering (1,2,3,4, 5), under which the set of backward edges is of the form 
{(4,1), (5, 2), (5,1)} (the so-called tree-ordering since the graph of backward edges is a tree) and or¬ 
dering (4,1,3, 5, 2) (the so-called cyclic ordering). Surprisingly, a very similar method may be used 
for tournament Tg 1 and tournament Tg. Note that both tournaments are prime. For tournament 
Tg 1 the two crucial orderings of vertices are: ( 1 , 2 ,3,..., 6 ), under which the set of backward edges is 
of the form {(4,1), (5,1), (5, 2), ( 6 ,3)} (so-called forest ordering) and ordering (2,4,1, 6 , 3,5), under 
which the set of backward edges is of the form {(5,1), (1, 2), (5, 2), (3,4)}. For a tournament T 6 2 the 
two crucial orderings of vertices are: ( 1 , 2 ,3,..., 6 ), under which the set of backward edges is of the 
form {(4,1), (5,2), (5,1), ( 6 , 3)} (so-called forest ordering) and ordering (5,2,4,1, 6 ,3), under which 
the set of backward edges is of the form {(3,4), (4, 5), (3, 5), (1, 2)}. Thus if T = C (J{C 5 , Tg 1 , T|} 
and F is the closure of F under substitutions, then there exists a sub-exponential algorithm that 
finds a polynomial- size transitive subtournament of a iL-free n- vertex tournament, where H £ F. 
Besides there exists a sub-exponential algorithm that colors any n-vertex iL-free tournament with 
0(n 1_e log(n)) colors, where H £ F■ 

Thus, using Theorem 16.11 we conclude that there exists a sub-exponential algorithm that colors 
every n-vertex H- free tournament with 0(n 1_e log(n)) colors, where H is an arbitrary tournament 
on at most 5 vertices or a tournament on 6 vertices different than Tq. For those tournaments 
H finding polynomial-size transitive subtournaments of H- free tournaments can be also done in 
sub-exponential time. 

It can be proven (jSj) that: 

6.1 Every tournament on at most 5 vertices is either isomorphic to C§ or is of the form H(F \,..., TX) 
for some constellation H with V(H) = {v\,...,Vh}, h > 1, and some constellations Ti,...,TX. Ev¬ 
ery tournament on at most 6 vertices is either isomorphic to Tq, Tg 1 , or T| or is of the form 
H(F \,..., Fff) for some tournament H with V(H) = {tq,..., i>h}, h > 1 and some tournaments 
F\; —,F h . 

The proof uses a brute-force method thus we skip it. 

Thus by using our techniques one can obtain a constructive proof of the Erdos-Hajnal conjecture 
for all tournaments on at most 5 vertices and all tournaments on 6 vertices but Tq. This result is 
interesting since in the undirected case there are still graphs on 5 vertices for which the conjecture 
is open. Furthermore, the conjecture is still open for all undirected graphs on 6 vertices that cannot 
be constructed from smaller graphs by the substitution procedure. 
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